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60! Abstract 

■ Schoning [7] presents a simple randomized algorithm for fc-SAT with running time 

O(o£poly(n)) for = 2(k — l)/k. We give a deterministic version of this algorithm 
running in time 0{{ak + e)™poly(n)), where e > can be made arbitrarily small. 
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C/5 ■ 1 Introduction 

q 

' In 1999, Uwe Schoning [7] gave an extremely simple randomized algorithm for A;-SAT. 

Ten years on, the fastest algorithms for A;-SAT are only slightly faster than his, and far 

more complicated. His algorithm works as follows: Let F be a (< A:)-CNF formula over 

n variables. Start with a random truth assignment. If this does not satisfy F, pick an 

■ arbitrary unsatisfied clause C. From C, pick a literal uniformly at random, and change 

the truth value of its underlying variable, thus satisfying C. Repeat this reassignment 

step 0{n) times. If F is satisfiable, this finds a satisfying assignment with probability at 

• 1 least 
00 : / k 

o U*-i) 

By repetition, this gives a randomized 0*(1.334 n ) algorithm for 3-SAT, an 0*(1.5 n ) for 
4-SAT, and so on (we use O* to suppress polynomial factors in n). Shortly after Schoning 
^ ■ published his algorithm, Dantsin, Goerdt, Hirsch, Kannan, Kleinberg, Papadimitriou, 

Raghavan and Schoning [2] (henceforth Dantsin et al. for the sake of brevity) came up 
with a deterministic algorithm that can be seen as an attempt to derandomize Schoning's 
algorithm. We say attempt because its running time is 0*{{2k/{k + l)) n ), which is ex- 
ponentially slower than Schoning's. For example, this gives an 0*(1.5 n ) algorithm for 
3-SAT and 0*(1.6 n ) for 4-SAT. Subsequent papers have improved upon this running 
time, mainly focusing on 3-SAT: Dantsin et al. already improve the running time for 
3-SAT to 0(1.481 n ), Brueggemann and Kern [Ij to 0(1.473"), Scheder [B] to 0(1.465 n ), 
and Kutzkov and Scheder [4] to 0*(1.439 n ). All improvements suffer from two drawbacks: 
First, they fall short of achieving the running time of Schoning's randomized algorithm, 
and second, they are all fairly complicated. In this paper, we give a rather simple deter- 
ministic algorithm with a running time that comes arbitrarily close to Schoning's, thus 
completely derandomizing his algorithm. We also show how to derandomize Schoning's 
algorithm for constraint satisfaction problems, which are a generalization of SAT, allowing 
more than two truth values. 
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1.1 Notation 



We use the notational framework introduced in [SJ. We assume an infinite supply of 
propositional variables. A literal u is a variable x or a complemented variable x. A finite 
set C of literals over pairwise distinct variables is called a clause and a finite set of clauses 
is a formula in CNF (Conjunctive Normal Form). We say that a variable x occurs in a 
clause C if either x or x are contained in it and that x occurs in the formula F if there is 
any clause where it occurs. We write vbl(C) or vbl(-F) to denote the set of variables that 
occur in C or in F, respectively. We say that F is a (< k)- CNF formula if every clause 
has size at most k. Let such an F be given and write V := vbl(F). 

A (truth) assignment is a function a : V — > {0,1} which assigns a Boolean value to 
each variable. A literal u = x (or u = x) is satisfied by a if a{x) = 1 (or a(x) = 0). A 
clause is satisfied by a if it contains a satisfied literal and a formula is satisfied by a if all 
of its clauses are. A formula is satisfiable if there exists a satisfying truth assignment to 
its variables. 

If a and f3 are two truth assignments over a set V of variables, then their (Hamming) 
distance d#(a,/3) is defined to be the number of variables x £ V where a(x) / /3(x), 
i.e. dff(a,/3) := \{x £ V \ a(x) / (3(x)}\. For a given assignment a, we denote the set 
of all assignments (5 with Hamming distance at most r from a by -B r (a) := {/3 : V — > 
{0, 1} | dji(a,/3) < r} and call this the Hamming ball of radius r centered at a. 

Formulas can be manipulated by permanently assigning values to variables. If F is a 
given CNF formula and x G vbl(.F) then assigning ml satisfies all clauses containing 
x (irrespective of what values the other variables in those closes are possibly assigned 
later) whilst it truncates all clauses containing x to their remaining literals. We will write 
pw-=i\ t Q denote the formula arising from doing just this, or equally F^- u:=1 ^ where u is 
a literal and we mean to assign the underlying variable the value necessary to satisfy u. 
If /3 is a partial assignment, i.e., defined on a subset of vb\(F), then F^ denotes the 
formula we obtain from F by permanently setting the variables from those subset to their 
respective values under f3. 

1.2 Previous Work 

Both Schoning's algorithm and its deterministic versions can be seen as not attacking 
SAT directly, but rather a parametrized local search problem: 

Promise-Ball-/c-SAT: Given a (< /c)-CNF formula F over n variables, 
an assignment a to these variables, a natural number r, and the promise that 
the Hamming ball B r (a) contains a satisfying assignment. Find any satisfying 
assignment to F. 

Let us clarify what we mean by saying "Algorithm A solves Promise-Ball-/c-SAT": 
If F, a, and r are as described above, i.e., if B r (a) contains a satisfying assignment, 
then A must return some satisfying assignment. We do not require this assignment 
to lie in B r (a), however. On the other hand, if F is unsatisfiable, or B r (a) contains 
no satisfying assignment, the behavior is unspecified. Of course, since we can quickly 
check any purported assignment that the algorithm outputs, we can assume the algorithm 
always either returns a satisfying assignment or failure. 

Schoning's original randomized algorithm for Promise-Ball-Zc-SAT as described in 
the introductory section, henceforth called Schoning, repeatedly selects any clause un- 
satisfied under a, then randomly picks a literal from that clause and flips the underlying 
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variable's value. The algorithm gives up if a satisfying assignment has not been encoun- 
tered by the time 0(n) steps have been performed (it is well-known and easy to check 
that n/(k — 2) correction steps are sufficient to achieve optimal efficiency). 

Lemma 1 (Schoning [7]). Let F be a (< k)-CNF formula, a a truth assignment to its 
variables, and r E N. If there is a satisfying assignment in B r {a), then with probability 
at least (k — 1)~ r , Schoning returns a satisfying assignment. By repetition, this gives a 
Monte-Carlo algorithm for Promise-Ball-Zc-SAT with running time 0*{{k — l) r ). 

Schoning turns this lemma into an algorithm for /c-SAT by choosing the assignment 
a uniformly at random from all 2 n truth assignments: 

Theorem 2 (Schoning |7J). There is a randomized algorithm that runs in polynomial 
time and finds a satisfying assignment of F with probability 

provided F is satisfiable. 

Proof. Let a* be a satisfying assignment of F and let a be an assignment chosen uni- 
formly at random from {0, 1}™. For each < r < n, the probability that the Hamming 
distance dn(a, a*) is r is (™) /2 n . In this case, Schoning's random walk returns a satisfying 
assignment with probability at least (k — l)~ r . The overall success probability thus is at 
least 




and the running time is clearly polynomial. □ 

By repeating the above algorithm, one obtains a Monte-Carlo algorithm for A;-SAT of 
running time 0*((2(k - l)/k) n ). 



Deterministic Algorithms 

What about deterministic algorithms? Dantsin et al. [2] give a simple recursive algo- 
rithm for Promise-Ball-A:-SAT running in time 0*(k r ): If a satisfies F, we are done. 
Otherwise, if r = 0, we can return failure. If r > 1 and a does not satisfy F, we let 
C be an unsatisfied clause. There are at most k literals in C, thus there are at most k 
possibilities to locally change a so as to satisfy C. We recursively explore each possibility, 
decreasing r by 1 (see Algorithm [T] for the details). The next achievement of Dantsin et 
al. is to show how a deterministic algorithm for Promise-Ball-/c-SAT can be turned 
into a deterministic algorithm for fe-SAT: 

Lemma 3 (Dantsin et al. [2J). If algorithm A solves Promise-Ball-^-SAT in time 
0*(a r ), then there is an algorithm B solving k-SAT in time O* f^^pH ^- Furthermore, 
B is deterministic if A is. 

Their algorithm to prove the lemma constructs a so-called covering code C C {0, l} n 
with the property that every assignment a £ {0, 1}" has a codeword 7 G C at a suit- 
ably small Hamming distance from a. Schoning's randomized selection of an initial as- 
signment is turned deterministic by iterating through all codewords 7 E C and solving 
Promise-Ball-/c-SAT around each of them. Provided that the formula is satisfiable, 



3 



one choice of 7 S C will be sufficiently close to a satisfying assignment for the subsequent 
local search to succeed. 

The recursive algorithm for Promise-Ball-£;-SAT of Dantsin et al. has running time 
0*(k r ). Therefore Lemma [3] gives a running time of 0*((2k/(k + l)) n ). For k = 3, clever 
branching rules have been designed to improve upon the 0*(3 r ) bound, leading to the 
respective improvements on deterministic running times mentioned in the first paragraph 
of this paper. 

1.3 Our Contribution 

Our contribution is to give a deterministic algorithm solving Promise-Ball-/c-SAT in a 
running time that gets arbitrarily close to that of the Monte-Carlo algorithm in Lemma [TJ 

Theorem 4. For every e > 0, there exists a deterministic algorithm which solves the 
problem Promise-Ball-A;-SAT in time 0((k — 1 + e) r ). 

Combining this theorem with Lemma [3] proves our main theorem: 

Theorem 5. For every e > 0, there is a deterministic algorithm solving k-SAT in time 



Before jumping into technical details, let us sketch the main idea of our improvement 
for k = 3. Let F be a 3-CNF formula and a some assignment. Suppose F contains t 
pairwise disjoint clauses C\, . . . , Ct, all of which are unsatisfied by a. We let Schdning's 
random walk algorithm process these clauses one after the other: In each clause Cj, it 
picks one literal randomly and satisfies it. Thus, of all 3* possibilities to choose one literal 
in each Cj, it chooses one uniformly at random. Let a* be an assignment satisfying F. 
With probability at least 3~*, Schdning's random walk chooses in each C{ a literal that 
a* satisfies. In this case, the distance from a to a* decreases by t. However, with much 
bigger probability, roughly 2~*/ 3 , the random walk chooses the "correct" literal in 2i/3 
clauses C% and a "wrong" literal in the remaining t/3. In this case, the distance from 
a to a* decreases by t/3. This is the power of Schdning's algorithm: It hopes to make 
a modest progress of t/3, which is much more likely than making a progress of t. Our 
key observation is that this choice of Schoning can be derandomized: There is a set of 
(roughly) 2*/ 3 choices which literal to satisfy in each Cj, such that at least one of them 
makes a progress of at least t/3. 



To begin with, we will formally state the recursive algorithm by Dantsin et al. [2] solving 
Promise-Ball-A;-SAT in time 0*{k r ). 

Proposition 6. Algorithm^ solves Ball-/c-SAT in time 0*{k r ) 

Proof. The running time is easy to analyze: If F is a (< A;)-CNF formula, then each call 
to searchball causes at most k recursive calls. To see correctness of the algorithm, we 
proceed by induction on r and suppose that a* satisfies F and df/(a,a*) < r. Let C 
be the clause selected in line El Since a* satisfies C but a does not, there is at least 
one literal u £ C such that a*(u) = 1 and a(u) = 0. Let a' := a*[u := 0]. We observe 
that d(a, a') < r — 1 and a' satisfies F^- u:=1 ^ (although not necessarily F). Therefore 
the induction hypothesis ensures that the recursive call to searchball(F[ u:=1 ],a,r - 1) 
return true. □ 




2 The Algorithm 
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Algorithm 1 searchball(CNF formula F, assignment a, radius r) 

1: if a satisfies F then 

2: return true 

3: else if r = then 

4: return false 

5: else 

6: C «— any clause of F unsatisfied by a 

7: for u G C do 

8: if searchball(F[ M:=1 ], a, r - 1) = true then 

9: return true 

10: end if 

11: end for 

12: return false 

13: end if 



Proposition 7. Suppose F is a (< k)- CNF formula, a a truth assignment to its variables, 
and r G N. // every clause in F that is unsatisfied by a has size at most k — 1, then 
searchball(F, a, r) runs in time 0*{(k — l) r ). 

Proof. The key observation is that if all clauses in F that are not satisfied by a have at 
most k — 1 literals, then the same is true for any formula of the form F^- U]=1 \ Therefore, 
any call to searchball entails at most k — 1 recursive calls. □ 

2.1 A>ary Covering Codes 

Before explaining our algorithm, we make a combinatorial detour to /c-ary covering codes, 
which will play a crucial role in our algorithm. 

The set looks similar to the Boolean cube {0,1}* in many ways. For 

example, it is endowed with a Hamming distance dn- For two elements w, w' £ {1, . . . , A;}*, 
we define dH(w,w') to be the number of coordinates in which w and w' do not agree. 
There are also balls: We define 

B&Xw) := {w' € {l,...,k¥ | d H (w,w') < r} . 

What is the volume of such a ball? Well, there are (') possibilities to choose the set of 
coordinates in which w and w' are supposed to differ, and for each such coordinate, there 
are k — 1 ways in which they can differ. Therefore, 

volW(t,r) := \B^(w)\ = Q (jfe - l) r . 

We are interested in the question how many balls B^\w) we need to cover all of {1, ... , k} f . 
Note that by symmetry, w 6 Br k \v) iff v G Br k \w) for any v, w G {1, . . . , A;}*. 

Definition 8. Zei i G N. A sei C C {1, . . . , k} f is called a code of covering radius r if 

\jB< k \w) = il,...,k¥. 

wee 

In other words, for each w' G {1, . . . , k} n , there is some w G C such that dn(w, w') < r. 
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The following lemma is an adaptation of a lemma by Dantsin et al. [2J, only for 
{1, . . . , k} 1 instead of the Boolean cube {0, 1}*. 

Lemma 9. For any t,k £ N and < r < t, there exists a code C C {1, . . . , k} 1 of covering 
radius r such that 

tln(fc)/c* 



\C\< 



Proof. The proof is probabilistic. Let m := [~(tln(fc)/c*)/((*)(/c — l) r )] and build C by 
sampling m points from {1, ...,k}, uniformly at random and independently. Fix an 
element w' G {1, . . . , k} 1 . We calculate 



PrK \J 4 fc) H] = 1 



vo 



l( fc )(t/, 



wee 



< e 



-|C|vol( & ) (t,r)/fc* < e -tln(fc) _ >-* 



By the union bound, the probability that there is any w' LLec Br k \w) is at most /c* 
times the above expression, and thus smaller than 1. Therefore, with positive probability, 
C is a code of covering radius r. □ 

2.2 A Deterministic Algorithm for Promise-Ball-A>SAT 

We will now describe our deterministic algorithm. First it chooses a sufficiently large 
constant t, depending on the e in Theorem HJ and computes a code C C {1, . . . , k} 1 of 
covering radius t/k. Since k and t are constants, it can afford to compute an optimal 
such code. We estimate its size using Lemma [9] and the following approximation of the 
binomial coefficient: 

Proposition 10 (MacWilliams, Sloane [5], Chapter 10, Corollary 9). For < p < 1/2 

and t G N, it holds that 



> 



1 



P) 



#(i - p) 

We apply this bound with p = 1/k: 

t\ 1 hi. f k ^ ( k -W k 



1 



(l-p)t 



> 



k t/k 



k f 



Together with Lemma we obtain, for t sufficiently large: 



C < 



iln(fc)fc* 



G; fc )(^-i) t/fc 



< 



jfc*(fc - 1)*/* 



< ^ 



The algorithm computes this constant-size code and stores it for further use. It then 
calls a recursive procedure that does the real stuff. That procedure first greedily con- 
structs a maximal set G of pairwise disjoint unsatisfied /c-clauses of F. That is, G = 
{Ci, C*2, . . . , C m }, the Ci are pairwise disjoint, each C, in G is unsatisfied by a, and each 
unsatisfied /c-clause D in F shares at least one literal with some Cj. 

At this point, the algorithm considers two cases. First, if m < t, it enumerates 
all 2 km truth assignments to the variables in G. For each such assignment /3, it calls 
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searchball(i ? ^l, a, r) and returns true if at least one such call returns true. Correct- 
ness is easy to see: At least one (3 agrees with the promised assignment a*, and therefore 
a* still satisfies To analyze the running time, observe that for any such /3, the for- 

mula .F^l contains no unsatisfied clause of size k. This follows from the maximality of G. 
Therefore, Proposition [7] tells us that searchball(i ? [^ , a, r) runs in time 0*((k — l) r ), 
and therefore this case takes time 2 km O*((k — l) r ). Since m < t, and t is a constant, this 
is 0*((k- l) r ). 

The second case is more interesting: If m > t, the algorithm chooses t clauses from 
G to form H = {Ci, . . . ,Cf}, a set of pairwise disjoint fc-clauses, all unsatisfied by a. 
At this point, our code will come into play, but first we introduce some notation: For 
w 6 {1, . . . , fc}*, let a[w] be the assignment obtained from a by flipping the value of the 
tUj th literal in Cj, for 1 < i < t. To do this, the algorithm has to choose a fixed but 
arbitrary ordering on H as well as on the literals in each C{. Note that a[w] satisfies 
exactly one literal in each Cj, for 1 < i < t. Strictly speaking a[w] depends not only on 
w, but also on H, so we should write a[H,w] instead of a[w\. However, as long as H is 
understood, we write a[w]. 

Let us give an example. Suppose a is the ail-O-assignment, t = 3 and H = {{x\ V y\ V 
zi), (x2 V y2 V Z2), (X3 V 2/3 V £3)}. Let w = (2, 3, 3). Then a[w] is the assignment that sets 
y±, Z2, and 2:3 to 1 and all other variables to 0. 

Consider now the promised satisfying assignment a* with dn^a, a*) < r. We define 
w* € {1, ... , kY as follows: For each 1 < i < t, we set w* to j such that a* satisfies the 
j th literal in Cj. Since a* satisfies at least one literal in each Cj, we can do this, but since 
a* possibly satisfies multiple literals in Cj, the choice of w* is not unique. Note that in 
any case d(a[t/;*], a*) = d(a, a*) — t < r — t. 

We could now iterate over all w £ {1, . . . , k} 1 and call searchball(i ? , a[w], r — t). This 
would essentially be what searchball does and would yield a running time of 0*(k r ), 
i.e., no improvement over Dantsin et al. Therefore, we do not do this. Instead, we let 
our code C play its crucial role: Rather than recursing on a[w] for each w 6 {1, . . . , A;}*, 
we recurse only for each w £ C. By the properties of C, there is some w' £ C such that 
df{{w' ,w*) = t/k. Observe what happens when we go from a to a[w/]: For at most t/k 
coordinates i, we have «4 7^ w*. For those coordinates, switching the w'^ literal of C{ 
in the assignment a increases the distance to a*. On the other hand, there are at least 
t — t/k coordinates i where = w*, and switching the w'^ 1 literal of Cj for such an i 
decreases the distance to a*. We conclude that the distance increases at most t/k times 
and decreases at least t — t/k times. Therefore 

d H (a[w'],a*) < d H (a, a*) + t/k - (t - t/k) < r - (t - 2t/k). 

Writing A := (t — 2t/k), the procedure calls itself recursively with a[w] and r — A for 
each w G C and at least one call will be successful. Let us analyze the running time: We 
cause \C\ recursive calls and decrease the complexity parameter r by A in each step. This 
is good, since \C\ is only slightly bigger than (k — 1) A . We conclude that the number of 
leaves in this recursion tree is at most 

ld'/ A < ( t \k - i) A y/ A = ((it - i)t 2 / A ) r . 
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bmce t 2 / A goes to 1 as t grows, the above term is, for sufficiently large t, bounded by 
{k — 1 + e) r . This proves Theorem [H We summarize the whole procedure in Algorithm [2j 



Algorithm 2 searchball-f ast(/c £ N, (< A;)-CNF formula F, assignment a, radius r, 
codeC C {!,..., A;}*) 



1: if a satisfies F then 
2: return true 
3: else if r = then 
4: return false 
5: else 

6: G a maximal set of pairwise disjoint /c-clauses of F unsatisfied by a 
7: if \G\ < t then 

8: for each assignment (3 to the variables in G do 

9: if searchball(.F^, a, r) = true then 

10: return true 

11: end if 

12: end for 
13: else 

14: £T«-{Ci,...,C t }CG 
15: for w G C do 

16: if searchball-fast(F, a[H, w],r — (t— 2t/k)) = true,C then 

17: return true 

18: end if 

19: end for 

20: end if 

21: end if 

22: return false 



3 Constraint Satisfaction Problems 

Constraint Satisfaction Problems, short CSPs, are generalizations of SAT, allowing more 
than two truth values. Formally, suppose there is a set V = {x±, . . . ,x n } of n variables, 
each of which can take on a value in {1, . . . ,d}. A literal is an expression of the form 
(xi / c) for c G {1, . . . , d}. A constraint is a disjunction of literals, for example 

(sci ^ 7 V x 2 / 5 V x 3 / d) . 

A CSP formula finally is a conjunction of constraints. We call it a (d, < A;)-CSP formula if 
its variables can take d values and each constraint has at most k literals. An assignment a 
to the variables V is a function a : V — > {1, . . • , d} and can be represented as an element 
from {1, . . . ,d} n . We say a satisfies the literal (xj 7^ c) if, well, a(xi) / c. It satisfies a 
constraint if it satisfies at least one literal in it, and it satisfies a CSP formula if it satisfies 
each constraint in it. Finally, (d, < A;)-CSP is the problem of deciding whether a given 
(d,< A;)-CSP formula has a satisfying assignment. Note that (2,/c)-CSP is the same as 
A;-SAT. Also, (d, < A;)-CSP is NP-complete except the following three cases: (i) d = 1, (ii) 
k = 1, (iii) d = k = 2. Cases (i) and (ii) are trivial problems, and (iii) is 2-SAT, which is 
solvable in polynomial time (well-known, not difficult to show, but still not trivial). 
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For the cases where (d, < k)-CSP is NP-complete, what can we do? Iterating through 
all d n assignments constitutes an algorithm solving (d, k)-CSP in time 0*(d n ). Schoning's 
algorithm [7J is much faster: 

Theorem 11 (Schoning [7]). There is a randomized Monte-Carlo algorithm solving (d, < 
k)-CSP in time 

Again, for d = 2 this is the running time of Schoning for A;-SAT. In his original 
paper [7], Schoning describes how his algorithm seamlessly generalizes to arbitrary domain 
sizes d > 2: in each correction step, after a variable to reassign has been selected at 
random, another random choice is made among the d — 1 values it may be changed to. 
The subsequent analysis in [7] also extends to this case. 

However, there is a more direct way to reduce the (d, < /c)-CSP for d > 2 to the 
Boolean problem which is then able to use any fc-SAT algorithm as a black box: we 
simply select for each variable, uniformly at random and independently from the other 
variables, 2 out of the d possible values in the domain. Any satisfying assignment survives 
this restriction with probability exactly {2/d) n and thus any /c-SAT algorithm with success 
probability p n generalizes to a (d, < fc)-CSP algorithm with success probability (2p/d) n . 
When plugging in Schoning for /c-SAT, we obtain Theorem II 11 

In order to generalize our deterministic variant to arbitrary domain sizes, we will 
choose the simple route and derandomize the aforementioned reduction instead of trying 
to rework the whole analysis from the previous section, with the additional advantage 
that the result scales for any further improvement on the running time for deterministic 
fc-SAT. 

Theorem 12. There exists a deterministic algorithm having running time 0*{{d/2) n ) 
which takes any (d,< k)-CSP F over n variables and produces I = 0*{{d/2) n ) Boolean 
(< k)- CNF formulas {Gj}i<j<; such that F is satisfiable if and only if there exists some 
i such that Gi is satisfiable. 

Using the fc-SAT algorithm we developed in the previous section, we then immediately 
get the derandomization of Theorem [TTJ 

Corollary 13. For every e > 0, there is a deterministic algorithm solving (d,< k)-CSP 
in time 

Proof of Theorem\lSl We start with a useful definition. A 2-box in {1, . . . ,n} d is a set 
of the form B := Pi x • • ■ x P n , where Pi C {1, . . . , d} and \P{\ = 2. A 2-box can be 
seen as a subcube of {1, . . . ,d} n of side length 2 and full dimension. A random 2-box is 
a 2-box sampled uniformly at random from all 2-boxes in {1, ... , d} n : This can be done 
by sampling each Pi independently, uniformly at random from all (jf) pairs in {1, . . . , d}. 
As mentioned above, the probability that any fixed satisfying assignment of F lies within 
a random 2-box is (2/d) n . 

In order to derandomize this technique, we need to deterministically cover {1, . . . , d} n 
with 2-boxes, in a fashion very similar to the covering codes used by Dantsin et al. [2]: 
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Lemma 14. Let d, n G N. There is a set B of 2-boxes in {1, ... , c?} n suc/i that 

U s = {i,...,4 n 

£>eS 

and 

f d\ n 
\B\ < I - ) poly(n) . 



n 2, 

Furthermore, B can be constructed in time 0(\B\). 

Given this lemma, our algorithm is complete: It first constructs such a suitably small 
set B of 2-boxes, and then, for each 2-box Pi x • • • x P n = B £ B, outputs a (< /c)-CNF 
formula arising from F by restricting the domain of the i th variable to the values in Pj. 
This finishes the proof of the theorem. □ 

It remains to prove the lemma. 

Proof of Lemma [T4\ Note that if d is an even number, the proof is easy. For 1 < j < d/2, 
define = {2j — 1, 2j}. Each element w & {1, ... , d/2} n defines the 2-box 



B w ■= pM x • • • x P«) 

and clearly 

U B w = {l,...,d} n . 

we{l,...,d/2} n 

The difficulty arises if d is odd. As Dantsin et al. [2], we first show the existence of a 
suitable set of 2-boxes, and then use a block construction and an approximation algorithm 
to obtain a construction. 

Lemma 15. For any n,d G N, there is a set B of 2-boxes such that \B\ < \n ln(d)(d/2) n ] 
such that Usee = {!>••• > d} n . 

Proof. The proof works exactly like the proof of Lemma [9J We sample \n ln(d)(<i/2) n ] 
many 2-boxes independently, uniformly at random and show that with positive probabil- 
ity, the resulting set has the desired properties. □ 

To prove Lemma [TH we have to derandomize the probabilistic argument we have just 
seen. For this, we choose a sufficiently large constant b, set n' := n/b and construct an 
instance of Set- Cover: The ground set is {1, . . . , d} n and the sets are all 2-boxes therein, 

of which there are (2) < d 2n ' . We know from Lemma [15] that there is a cover of 2-boxes 
of size \nln(d)(d/2) n ~\. There is a greedy algorithm for Set-Cover (see Hochbaum |3j 
for example) achieving an approximation ratio of O(logiV), where iV is the size of the 
ground set. Since in our case logiV = log(<i n ) = O(n'), this algorithm will give us a set 
B of 2-boxes covering {1, . . . , d} n of size 



r\2 




\B\ € O (n 



How much time do we need to construct Bl The greedy algorithm is polynomial in the 
size of its instance, which is 0(d? n ), thus it takes time 0(d 2Cn ) for some constant C . 
By choosing b large enough, we can make sure that d 2Cn = d 2Cn l h is smaller than the 
running time we are aiming at. Finally, we obtain a set of 2-boxes in {1, ... ,d} n by 
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"concatenating" the boxes in B: We identify a tuple (B±, . . . ,Bf,) G B b with the 2-box 
B\ x • • • x Bt,, and therefore B b is a set of 2-boxes covering {1, . . . , n}, and 



This is a set of 2-boxes covering {1, ...,d} n of the desired size, finishing the proof of 
Lemma HH □ 
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